Optimal Selection of Proportional Bounding Quantifiers in Linguistic Data Summarization
نویسنده
چکیده
Proportional bounding quantifiers like “Between p1 and p2 percent” are potentially useful for expressing linguistic summaries of data. Given p1, p2, existing methods for data summarization based on fuzzy quantifiers can be used to assign a quality score to the summary. However, the problem remains how the optimal choice of p1, p2 in the range 0≤ p1 ≤ p2 ≤ 100% can be established. Moreover, the proposed quality indicators are rather heuristic in nature. The paper presents a method for computing the optimal bounding quantifier which best summarizes the given data. Specifically, the most specific quantifier will be chosen which results in the highest validity score of the summary given a constraint on the the percentage range p2− p1. The method not only assigns validity scores to the quantifiers of interest but also determines the best choice of quantifier in O(N logm) time, where N is the size of the base set and m the number of different membership grades in the fuzzy arguments.
منابع مشابه
Fuzzy Quantifiers for Data Summarization and their Role in Granular Computing
Data summarization is an enabling technique of Granular Computing, because of its promise to abstract from individual observations and to view a phenomenon as a whole. The linguistic summaries are built around a fuzzy quantifier which functions as the ‘summarizer’. Linguistic data summarization therefore presupposes an underlying model of fuzzy quantifiers, which is of crucial importance to the...
متن کاملWorking Papers of the IJCAI-2013 Workshop on Weighted Logics for Artificial Intelligence
Quantifiers have the ability of summarizing the properties of a class of objects without enumerating them. This talk introduces a framework for modeling quantifiers in natural languages in which each linguistic quantifier is represented by a family of non-additive measures, and the truth value of a quantified proposition is evaluated by using Sugeno’s integral. Some elegant logical properties o...
متن کاملProtoforms of Linguistic Database Summaries as a Human Consistent Tool for Using Natural Language in Data Mining
We consider linguistic database summaries in the sense of Yager (1982), in an implementable form proposed by Kacprzyk & Yager (2001) and Kacprzyk, Yager & Zadrożny (2000), exemplified by, for a personnel database, “most employees are young and well paid” (with some degree of truth) and their extensions as a very general tool for a human consistent summarization of large data sets. We advocate t...
متن کاملارائه سیستم خلاصه ساز متون فارسی برمبنای ویژگی های زبان شناختی و رگرسیون
Considering the vast amount of existing written information and the shortage of time, optimal summarization of books, articles, news reports, etc. on the Web is a major concern of researchers. In this paper, we propose a new approach for Persian single-document Summarization based on several linguistic features of text. In our approach after extracting the linguistic features for each sentence,...
متن کاملAn extended intuitionistic fuzzy modified group complex proportional assessment approach
Complex proportional assessment (COPRAS) methodology is one of the well-known multiple criteria group decision-making (MCGDM) frameworks that can focus on proportional and direct dependences of the significance and utility degree of candidates under the presence of mutually conflicting criteria in real-worldcases. This studyelaboratesa newintuitionistic fuzzy modified group complex proportional...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006